Exploratory Genomic Data Analysis
نویسنده
چکیده
Chapter Overview In this chapter, an introductory description of the exploration of genomic data is given. Rather than attempt an exhaustive overview of the types of genomic data and methods of analysis, the chapter focuses on one type of data, gene expression profiling by microarray technology, and one method of analysis, cluster analysis for discovering and sorting mixed populations. This type of data and method of analysis is very common in bioinformatics. It illustrates recurring problems and solutions. And a major portion of bioinformatics dealing with exploratory genomic data analysis can be viewed as a refinement and extension of this basic analysis.
منابع مشابه
PAPST, a User Friendly and Powerful Java Platform for ChIP-Seq Peak Co-Localization Analysis and Beyond
Comparative co-localization analysis of transcription factors (TFs) and epigenetic marks (EMs) in specific biological contexts is one of the most critical areas of ChIP-Seq data analysis beyond peak calling. Yet there is a significant lack of user-friendly and powerful tools geared towards co-localization analysis based exploratory research. Most tools currently used for co-localization analysi...
متن کاملCAGO: A Software Tool for Dynamic Visual Comparison and Correlation Measurement of Genome Organization
CAGO (Comparative Analysis of Genome Organization) is developed to address two critical shortcomings of conventional genome atlas plotters: lack of dynamic exploratory functions and absence of signal analysis for genomic properties. With dynamic exploratory functions, users can directly manipulate chromosome tracks of a genome atlas and intuitively identify distinct genomic signals by visual co...
متن کاملExploratory visualization of array-based comparative genomic hybridization
Recent developments in DNA microarray technology have enabled a new and highly effective platform for performing comparative genomic hybridization (CGH) measurements. CGH measures anomalies in DNA copy number. Such copy number changes are now thought to play an important role in a number of diseases, particularly cancer and developmental disorders, and may also lead to important insights releva...
متن کاملvalr: Reproducible genome interval analysis in R
New tools for reproducible exploratory data analysis of large datasets are important to address the rising size and complexity of genomic data. We developed the valr R package to enable flexible and efficient genomic interval analysis. valr leverages new tools available in the "tidyverse", including dplyr. Benchmarks of valr show it performs similar to BEDtools and can be used for interactive a...
متن کاملAnalysis of Gene Expression Profiles and Drug Activity Patterns by Clustering and Bayesian Network Learning
High-throughput genomic analysis provides insight into a complicated biological phenomena. However, the vast amount of data produced from upto-date biological experimental processes needs appropriate data mining techniques to extract useful information. In this paper, we propose a method based on cluster analysis and Bayesian network learning for the molecular pharmacology of cancer. Specifical...
متن کامل